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Abstract 
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1 Introduction 



It is well known to both complex analysts and combinatorialists that the problem 
of reverting a power series, i.e. solving f{z) = w, has an explicit solution known 
as the Lagrange (or Lagrange-Biirmann) inversion formula [3,7,16,18,34,42,47,48]. 
What seems to be less well known is that the more general implicit-function problem 
F{z, w) — also has a simple explicit solution, given by Yuzhakov [49] in 1975 (see 
also [2,4,50]). My purpose here is to give a slightly more general and flexible version 
of Yuzhakov's formula, and to show that its proof is an utterly trivial generalization of 
the standard proof of the Lagrange inversion formula. A formal-powcr-series version 
of the formula presented here appears also in [42, Exercise 5.59, pp. 99 and 148], 
but its importance for the implicit-function problem does not seem to be sufficiently 
stressed. 

Let us begin by recalling the Lagrange inversion formula: if f{z) — X^^i ^n-^" 
with oi 7^ (interpreted either as an analytic function or as a formal power series), 
then 

r'w-E^r-'i(^)". M 

where [C"']5'(C) = g^"'\0)/n\ denotes the coefficient of in the power series g{C)- 
More generally, if h{z) = X^^o^"-^"' ^® i^scve 

h{r\w)) = MO) + E ^ [C"^"'] ^'(0 (t^) ■ (1-2) 

Proofs of these formulae can be found in many books on analytic function theory 
[7,18,34,47] and enumerative combinatorics [3,16,42,48]. 

It is convenient to introduce the function (or formal power series) g{z) = z/ f{z); 
then the equation f{z) — w can be rewritten as z — g{z)w, and its solution z — 
(fi{w) — f~^{w) is given by the power series 

^-^ m 

m=l 

and 

oo ^ 

h{^{w)) = h{0) + E — [C'""']^'(05(C)™- (1-4) 

m=l 

There is also an alternate form 

oo 

h{ip{w)) = h{0) + E [C'^J/^IO b(C)'" - Cg'iOgiCr-'] ■ (i-S) 

m=l 

Consider now the more general problem of solving z = G{z, w), where G(0, 0) = 
and \{dG/dz){0, 0)| < 1. I shall prove here that its solution z — ip{w) is given by the 
function series 

oo 

^(w) = j2-[r'']G(c,wr. (1.6) 
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More generally, for any analytic function (or formal power series) H{z,w), we have 



H{^{w),w) = H{o,w) + Y,-[r-'] 



m=l 



dH{C,w) 



m 



(1.7) 



oo 



dC 



m=l 



(1.8) 



The formulae fll.6p - fll.8l) are manifestly generalizations of the Lagrange inversion 
formulae f ll.3l) -( fT31) . to which they reduce when G{z, w) = g{z)w and H{z, w) = h{z). 
It turns out that the proof of fll.6p -( fL8l) is, likewise, virtually a verbatim copy of the 
standard proof of fll.3p - fll.5p : the variables w simply "go for the ride". 

The problem of solving z = G{z, w) can alternatively be interpreted as a fixed- 
point problem for the family of maps z ^ G{z, w) parametrized by w. From this point 
of view, f ll.6p -f |L8l) are simply a function series giving the unique solution of this fixed- 
point problem under a suitable "Rouche-contraction" hypothesis [see hypothesis (c) 
of Theorem 12.31 below] . Once again, the variables w simply "go for the ride" . 

Before proving fll. 60 - 01. 8p . let us observe how these formulae solve the implicit- 
function problem F{z,w) = 0, where -F(0, 0) = and {dF/dz){0,0) = aio ^ 0. It 
suffices to choose any analytic function 7(2;, w) satisfying 7(0, 0) 7^ 0, and then define 



Clearly F{z,w) = is equivalent to z = G{z,w), at least locally in a neighborhood 
of {z,w) = (0,0). Then {dG/dz){0,0) = 1 -7(0,0)010; so for fra-fOD to be 
applicable, it suffices to arrange that |1 — 7(0, 0)aio| < 1, which can easily be done by 
a suitable choice of 7(0, 0) [namely, by choosing 7(0, 0) to lie in the open disc of radius 
l/|aio| centered at 1/aio]. I wish to stress that each such choice of a function 7 gives 
rise to a valid but different expansion fll.6l) - fll.8l) for the solution z = ^{w). Even in 
the special case of the inverse-function problem f{z) = w, this flexibility exists and is 
useful (see Example 14.21 below). One important class of choices has 7(0,0) = 1/aio, 
so that {dG/dz){0,0) = 0; this latter condition leads to a slight simpliflcation in the 
formulae (see Remark 12.41 below). This special class in turn contains two important 
subclasses: 

• Yuzhakov [49] takes 7 to be the constant function 1/aioQ 

• Alternatively, we can choose 7 so that 7(2,0) = z/F{z,0) (this deflnition can 
still be extended to w 7^ in many different ways). Then G{z,0) = 0, so that 
G{z, w) has an overall factor w. This leads to a further slight simpliflcation (see 
again Remark 12.41) . 

^ His subsequent generalization [50] [2, Theorem 20.2 and Proposition 20.4] in effect allows a 
preliminary change of variables z' = z — ^(w) for arbitrary ip satisfying V'(O) = 0. 



G{z,w) 



z — 7(z, w)F{z, w) . 



(1.9) 
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Conversely, the problem of solving z = G{z, w) is of course equivalent to the 
problem of solving F{z, w) = if we set 



F{z,w) 



= z — 



G{z,w) ; 



(1.10) 



and the condition {dF /dz){0^0) 7^ is satisfied whenever {dG/dz){0^0) ^ 1. So 
our parametrized fixed-point problem has exactly the same level of generality as the 
implicit-function problem. 

The plan of this paper is as follows: First I shall state and prove four versions of 
the formulae fll.6p - fll.8p : one in terms of analytic functions (Theorem 12.31) . and three 
in terms of formal power series (Theorems 13.51 13.61 and 13.81) . Then I shall give some 
examples and make some final remarks. 

Since this paper is aimed at a diverse audience of analysts and combinatorialists, 

1 have endeavored to give more detailed proofs than would otherwise be custom- 
ary. I apologize in advance to experts for occasionally boring them with elementary 
observations. 

2 Implicit function formula: Analytic version 

In the analytic version of fll.6p - (ll.8l) . the variable w simply "goes for the ride"; 
consequently, w need not be assumed to lie in C, but can lie in the multidimensional 
complex space C*^ or even in a general topological space W. I shall begin with a 
simple auxiliary result (Proposition 12. ip that clarifies the meaning of the hypotheses 
of Theorem 12. 3[ I denote open and closed discs in C by the notations = G 
C: \z\ < R} and Br = {z e C: \z\ < R}. 

Proposition 2.1. Let W be a topological space, let R> 0, and let G : D/j x W C 

be a jointly continuous function with the property that G{- ,w) is analytic on D/j for 
each fixed w G W . Suppose further that for some Wq G W we have G(0, Wq) = and 
\{dG/dz){0,wo)\ < 1. Then for all sufficiently small p > and e > there exists 
an open neighborhood Vp 3 wq such that \G{z,w)\ < (1 — e)|z| whenever \z\ = p and 
w G Vp. 

Proof. The function 



is analytic on D/j, hence continuous on D^; and by hypothesis |/(0)| < 1. It follows 
that for all sufficiently small p > and e > 0, we have 1/(^)1 < 1 — 2e whenever 
\z\ < p. We now use the following simple topological fact: 

Lemma 2.2. If F: X x Y ^ is continuous and X is compact, then the function 
g: Y defined by g{y) = sup F{x,y) is continuous. 




ifO<\z\ <R 



(2.1) 
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Applying this with X = {z & C: \z\ = p} and y = V^, we conclude that there exists 
an open neighborhood Vp 3 Wq such that \G{z,w)/z\ < 1 — e whenever \z\ = p and 
w eVp. □ 

For completeness, let us give a proof of the lemma: 

Proof of Lemma 12.21 First of all, the compactness of X guarantees that g is 
everywhere finite. Now, since the supremum of any family of continuous functions 
is lower semicontinuous, it suffices to prove that g is upper semicontinuous, i.e. that 
for any z/q E Y and any e > there exists an open neighborhood V 3 yo such that 
9{y) < 9{yo) + f foi' all y E V. To see this, first choose, for each x E X, open 
neighborhoods 3 x and 3 yo such that F{x',y) < F{x^yo) + e < g{yo) + e 
whenever x' G and y G V^. By compactness, there exists a finite set {xi, . . . ,Xn} 

n 

such that {Uxi}i=i covers X. Setting V = f]Vx^ gives the required neighborhood. 

1=1 

□ 

It is easy to see that the conclusion of Lemma [2^ need not hold if X is noncompact. 
For instance, with X = M and Y = [0, 1], take F{x, y) = xy or F{x, y) = tanh(x?/). 
We can now state the principal result of this section: 

Theorem 2.3 (Implicit function formula — analytic version). 

Let V be a topological space, let p > 0, and let G,H: I])p x V ^ C be functions 
satisfying 

(a) G, dG/dz and H are jointly continuous on Dp x V; 

(b) G{- ,w) and H{ ■ , w) are analytic on Dp for each fixed w G V; and 

(c) |G(-2, w)\ < \z\ whenever \z\ = p and w E V. 

Then for each w E V, there exists a unique z G Dp satisfying z = G{z,w). Further- 
more, this z = <f{w) depends continuously on w and is given explicitly by the function 
series 

H{^{w),w) = H{0,w) + f^-[r-']^^%^G{Cwr (2.2) 



m dC 

m=l ' 

= H{0,w) + f2[nHiC,w)[GiC,wr - C^^^f^GiCw)" 

m=l 

(2.3) 

which are absolutely convergent on V, locally uniformly on 

^ "Locally uniformly" means that for each w E V there exists an open neighborhood U 3 w on 
which the convergence is uniform. This implies, in particular, that the convergence is uniform on 
compact subsets of V (and is equivalent to it if V is locally compact). 
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//, in addition, V is an open subset of C'^^ and G is analytic on Dp x V, then (p 
is analytic on V . 

Finally, if V is a polydisc (or more generally a complete Reinhardt domain) cen- 
tered at ^ C^'^ , and G and H are analytic onBip x V and satisfy G{0, 0) = 0, then 
(p{w) is also given by the Taylor series 

H{ip{w),w) = J2 C"^" (2.4) 

which is absolutely convergent on V, uniformly on compact subsets of V ; here the 
coefficients Ca are given by the absolutely convergent sums 

c„ = [w'^]H{o,w) + f2-[r-'n,n ^^^^;''^ G{c,wr (2.5) 



m=l 



K]iJ(0,^) + Y,[CV]H{C,w)[G{(,wr - C^^^f^GiCw'^-' 



(2.6) 



Remark 2.4. In many cases (12.51) is actually a finite sum. For instance, if (dG/dz) (0, 0) = 
— as occurs in particular in Yuzhakov's [2,49,50] approach — then each factor of 
G{(, w) brings either ("^ or w (at least); simple algebra then shows that the summand 
in (12.51) is nonvanishing only for m < 2\a\ — 1, where \ct\ = X]f=i ^^il^ Under the 
stronger hypothesis G{z,0) = 0, each factor of G{(,w) brings at least one w, so the 
summand is nonvanishing only for m < \cx.\. 

Analogous comments hold for (12. 6p . where the conditions are m < 2\a.\ and m < 
|a|, respectively. □ 



As previously stated, the proof of Theorem l2.3l is a trivial modification of the stan- 
dard textbook proof of the Lagrange inversion formula [7,34,47], but for completeness 
let us give it in detail. 

Proof. Hypotheses (a)-(c) combined with Rouche's theorem imply that for each 
w E V, the number of roots (including multiplicity) of ^ — G{z, w) = in the disc 
\z\ < p is the same as the number of roots of z = in this disc, namely one; so let 
us call this unique (and simple) root z = ^(w)^ It follows that for each w & V, the 
function 

H{C,w) ; (2.7) 



^ Yuzhakov [49, Proposition 1] [2, Proposition 20.4] writes m < 2\a\, which is correct but not as 
strong as it should be. 

It is well known (and is easily proved using Rouche's theorem and Lemma 12. 2p that this root 
depends continuously on w. This will also follow from the explicit formula (|2.2p / 
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is continuous on |^| < p and analytic in |^| < p except for a simple pole at C = 
with residue H(lp(w),w)^ Caucliy's integral formula therefore gives 



H{^{w),w) = ^ ^ H{C,w) 



1-f (C,^) 



\C\=P 



Let us now expand 



(-G{C,w) 



E 

m=0 



c 



dC. 



m+l 



(2.8) 



(2.9) 



and observe that this sum is absolutely convergent, uniformly for ( on the circle 
I CI = p of integration and locally uniformly for w & V [this follows from hypothesis 



(c) and Lemma [2.2j . We therefore have 



H{(p{w),w) 



y— 



r?i=0 



c 



m+l 



1 ^^(r- \ 



Kl=p 



By the Cauchy integral formula, this gives 



dG 



(2.10) 



H{^{w)M = 5^n//(C,^)G(C,«;)'" - $^[n^(C,^)^G'(C,«;) 



m=0 



m=0 



m=l 



m=l 



(2.11) 



This proves the alternate formula (11. 8p / (12. 3p , in which the sum is absolutely conver- 
gent on V, locally uniformly on V. 

To prove (II. 7p / (12. 2p . we start from (I2.10p and prepare an integration by parts: 

H{C,w) dC _ 1 H{C,w) d_^^^^^^ 



m+l (' 



m+l 



d( 1 g(C,^^) ^m+i^ _ d ( H{C,w) 



dC\m + l C™+i 



m + l dC\ C"+^ 



^ 1^ 1 -^(C,^^) (^m+A _ ^m+l ( 

dC\m + l C"+^ J \i 



(m + l)C™+i C™+2 

(2.12) 

where the prime denotes d/d(. Since the total derivative gives zero when integrated 
around a closed contour, we have 



°° 1 f 
H{^iw),w) = Y.^,f 

"^=° ICI=P 



^(C,«^)^m+l 



C 



m+l 



(m + l)("^+i 



■"m+2 



(2.13) 



It is in this step that we use the continuity of dG/dz (as well as that of G and H) on the closed 



disc 
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Now the first and third terms in brackets cancel when summed over m, except for the 
first term at m = 0, which gives simply (l/27ri) f[H{(,w)/(] d( = H{0,w). Hence 



m=0 

oo 



(m+l)C 



1 r 

m=l 



oo ^ 

^-^ m 



GiCw)"^dC 



w) 



(2.14) 



m=l 



This proves the fundamental formula (11.71) / (12. 2p . in which the sum is absolutely 
convergent on V, locally uniformly on V. 

It follows from this formula [taking H{z, w) = z] that if F C C^^ and G is analytic, 
then if is analytic as well; and if also H is analytic, then so is w i— > H{ip{w), w). 

Finally, if is a polydisc (or more generally a complete Reinhardt domain) cen- 
tered at G C^, and G and H are analytic, then the analytic function H{(p{w),w) 
is given in by a convergent Taylor series. The coefficients of this Taylor series are 
given by ( 12. 5p / ( 12. 6p because the absolutely convergent function series (12. 2p / ( 12. 3p can 
be differentiated term-by-term. □ 

Remark 2.5. The following alternative calculation provides a slightly slicker proof 
of Theorem 12. 3t start from (12. 8p and write 



1-f (C^) 
C-GiC,w) 



d 



H{C,w)—\og[C-G{C, 



w 



H{C,w) 



1 d , 

7 + 7^ log 



G{C,w) 
C 



(2.15) 



where hypothesis (c) and Taylor expansion guarantee that the function log[l— G(C, w) / (] 
is well-defined and single- valued on the circle \(\ = p; furthermore, a simple compact- 
ness argument extends this to some annulus pi < \(\ < p (locally uniformly in w). 
Now, the first term in (12.151) . when integrated, yields H{0,w). To handle the second 
term, let us integrate by parts, Taylor-expand the log, and then extract the residue 
from the resulting Laurent series: we get 



--^//'(C.uOlo, 



1 - 



c 



m=l ^ ^ 

j2-[r-']H'{c,w)G{c,w) 



(2.16) 



m=l 



which is (11.71) / (12.20 . A similar argument without integration by parts yields the al- 
ternate formula (ll.Sp / (12. 3p . I thank Alex Eremenko for helpful comments concerning 
this proof. □ 
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Remark 2.6. Formula (11.71) / (12.21) can alternatively be deduced from the standard 
Lagrange inversion formula (11. 4p by an argument due to Ira Gessel [42, p. 148]: 
Introduce a new parameter t G C, and study the equation z = tG{z, w) with solution 
z = ^{w,t). Applying (11.41) with w fixed and t as the variable, we obtain 

Hmw,t),w) = HiO,w) + f;^[C-i]^^^ G(C, (2.17) 

^ — ^ m oC 

m=l 

Setting t = 1 yields (11.71) / (12. 2p . [Hypothesis (c) guarantees that sup \G{C,w)/C\ = 

\C\=P 

C < 1, so that the Lagrange series is convergent for |t| < l/C] An analogous 
argument starting from (II. 5p yields the alternate formula (11.80 / (12. 3p . □ 

Remark 2.7. For some purposes the disc Dp can be replaced an arbitrary domain 
(connected open set) D G C Hypothesis (c) is then replaced by the assumption that 
the image G{D,w) is relatively compact in D for all w eV. Under this hypothesis, 
the map z \—>- G{z, w) is a strict contraction in the Poincare metric [26] on G{D, w), i.e. 
satisfies dpom{G{zi,w), G{z2,w)) < Kdpoin{zi, Z2) for some n < 1 (locally uniformly 
in w). It then follows from the contraction- mapping fixed-point theorem that there 
is a unique fixed point ^p{w), which moreover can be obtained by iteration starting 
at any point of D. That is, if we define 

GoiCw) = C (2.18a) 

G„+i(C,w) = G{G^r.{(,w),w) (2.18b) 

then Gn{C,w) uniformly for ( G D and locally uniformly for w G V. It 

would be interesting to know whether this can be used to provide a function series 
analogous to (ll.6p - (ll.8p based on the Taylor coefficients of G at an arbitrary point 
(0 E D. An analogous argument works for domains D C C^, using the Kobayashi 
metric [22,24-26,30], provided that some iterate G'^{D,w) is Kobayashi-hyperbolic. 
I thank Alex Eremenko for suggesting the use of the Poincare and Kobayashi metrics. 
□ 

Question 2.8. Can Theorem 12.31 be generalized to allow H{z,w) to have a pole 
at 2; = 0? Please note that by linearity it suffices to consider H{z,w) = h{w)z~^ 
(A; > 1); and since h{w) just acts as an overall prefactor, it suffices to consider simply 
H{z, w) = z~^ . Of course we will somehow have to restrict attention to the subset of 
V where '^{w) 7^ 0; and some hypothesis will be needed to guarantee that this subset 
is nonempty, i.e. that ip is not identically zero. □ 



3 Implicit function formula: Formal-power-series 
version 

In this section we shall consider G{z,w) to be a formal power series in inde- 
terminates z and w = {wi)i^j, where / is an arbitrary finite or infinite index set. 
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The coefficients in this formal power series may belong to an arbitrary commutative 
ring-with-identity-element R. (For some purposes we will want to assume further 
that the coefficient ring R contains the rationals as a subring. In applications, R 
will usually be a field of characteristic — e.g. the rationals Q, the reals M, or the 
complex numbers C — or a ring of polynomials or formal power series over such a 
field.) See [8,16,18,21,36] for basic facts about formal power series. We recall that 
-R[[w]] denotes the ring of formal power series in the indeterminates w = {wi)i^i with 
coefficients in R. 

We begin with a well-known proposition asserting the existence and uniqueness 
of the formal power series ip{w) solving the equation z = G{z,w), or equivalently 
F{z,w) = 0. Here R can be an arbitrary commutative ring with identity element; it 
need not contain the rationals or even be of characteristic 0. (For instance, the ring 
Z„ of integers modulo n is allowed.) 

Proposition 3.1 (Implicit function theorem for formal power series). 

(a) Let R be a commutative ring with identity element. Let F{z,w) be a formal 
power series in indeterminates z and w = {wi)i^j, with coefficients in R; suppose 
further that F{0, 0) = and that {dF/dz){0, 0) is invertible in the ring R. Then 
there exists a unique formal power series ip{w) with zero constant term satisfying 
F{^{w),w) = 0. 

(b) Let R be a commutative ring with identity element. Let G{z,w) be a formal 
power series in indeterminates z and w = {wi)i(zi, with coefficients in R; suppose 
further that G(0,0) = and that 1 — {dG/dz){0,0) is invertible in the ring R. 
Then there exists a unique formal power series (p{w) with zero constant term 
satisfying (p{w) = G{ip{w),w) . 

If the ring i? is a field, then the hypothesis that {dF/dz){0,0) be invertible in 
R means simply that {dF/dz){0,0) 7^ 0. If i? is a ring of formal power series over 
a field, then this hypothesis means that the constant term of {dF/dz){0,0) is 7^ 0. 
Analogous statements apply to part (b), with {dG/dz){0,0) 7^ 1. 

Proof of Proposition 13.11 It suffices to prove either (a) or (b), since they are 
equivalent under the substitution F{z,w) = z — G{z,w). We shall prove (b). Let us 
write 

00 

Giz,w) = Y,gk{w)z' (3.1) 

with each gi^ G by hypothesis go{0) = 0, and 1 — gi{0) is invertible in the ring 

R. The equation ip{w) = G{ip{w),w) can now be written as 

00 

^(w) = J^dkMviw)'' (3.2) 

k=0 



or equivalently 

ip[w) = [1- gi 



9o{w) + [gi{w) - gi{0)] ip{w) + ^gk{w)ip{wY 



k=2 



(3.3) 
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Since the series go{w), giiw) — 5'i(0) and ifiw) all have zero constant term, we see 
from (13. 3p that the coefficients in ^{w) = X]|ci|>i ^ct""^" '^^^ uniquely determined 
by induction on \(x\. Conversely, the unique solution of this system of equations 
necessarily solves (p{w) = G{(p{w),w). □ 



A multidimensional generalization of Proposition 13.11 in which z is replaced by a 
vector of indeterminates (zi, . . . ,zn), can be found in Bourbaki [5, p. A.IV.37]. As 
one might expect, the hypothesis is that the Jacobian determinant det{dF/dz){0,0) 
is invertible in R. 

We can carry this argument further and provide an explicit formula for (p{w). Let 
us begin with what appears to be a special case, but in fact contains the general result: 
namely, let us take G{z,g) = ^'^^qQuZ'^ where g = {gn)'^=o are indeterminates. We 
then have the following "universal" version of the Lagrange inversion formula [13, 
Theorem 6.2]: 

Proposition 3.2 (Universal Lagrange inversion formula). Let g = {gn)'^=Q be inde- 
terminates. There is a unique formal power series (f G '^[[g]] with zero constant term 

oo 

satisfying (p{g) = Yl 9nVi.9Y ; ^'^^ ^^■^ coefficients are given explicitly by 



n=0 



( ^ CO X 

( E^n - 1)! 

f, ^n=0 ^ 



[9l'A'g'i'---M9) 



n=0 



otherwise 



for all integers i > 1. In particular, for each pair (feg, ^i) there are finitely many 
{k2, ^3, . . .) for which p.^j ) is nonzero, so that ip{gY is a formal power series in go, gi 
whose coefficients are polynomials in g2,g3, ■ ■ ■■' that is, ip G Z[g2,g3, . . •] [[(^o? fi'i]] ■ 
We can also write the formula 

oo / oo \ m-l / go \ 

VigY = Etc'""'] [Y.9nC - ""^dnC , (3.5) 

m=l \n=0 / \n=0 / 

and in the ring Q[[g]] we can write 

oo /ex. \- 

^^aY = j:^ir-'][j:9nc] . (3.6) 

m=l \n=0 / 



Proof. The functional equation ip{g) = Yl'^=Q9n^{gY is the same equation as is 
satisfied by the (ordinary) generating function for unlabeled plane trees (i.e., rooted 
trees in which the vertices are unlabeled but the subtrees at each vertex are linearly 
ordered), in which a vertex having n children gets a weight and the weight of a tree 
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Figure 1: The functional equation ^p{g) = 'Yl'^=o9nV{9)^ for the generating function 
of unlabeled plane trees. 



is the product of its vertex weights (see Figure [T])Jj Since the solution of this equation 
is unique by Proposition 13.1( b). it follows that v'(sr) is the generating function for 
unlabeled plane trees with this weighting. More generally, ^{gY is the generating 
function for unlabeled plane forests with (i components, with the same weighting It 
is a well-known (though nontrivial) combinatorial fact [42, Theorem 5.3.10] that the 
number of unlabeled plane forests with I components having type sequence (/cq, fci, . . .) 
[i.e., in which there are kn vertices having n children, for each n > 0] is given precisely 
by dH. 

Since the constraint in (13. 4p can be written as /cq = ^ + ^1^=2^'"^ ~ l)^n, one sees 
immediately that for each pair (/cq, /ci) there are finitely many {k2, k^, ■ ■ ■) for which 
(13. 4p is nonzero. 

To prove (13. 5p . let us expand out the summand on the right-hand side, choosing 
n = N in the last factor: we get 

/ oo \ m-l / oo 

\n=0 / \n=0 



ko,ki,k2,...>0 
J2 kn=m 



(3.7) 



oo 



TTl — 1 \ -,— w 

where I J = (m — 1)! /ctv/ H ^n' is a multinomial co- 



^ The key fact here is that if S is any sum of terms, then a term in the expansion of E" — EE • • • E 
is obtained by choosing, in order, a term of E for the first factor, a term of E for the second factor, 
etc. This is why one obtains trees in which the subtrees at each vertex are hnearly ordered. 

^ An unlabeled plane forest is a forest of rooted trees with unlabeled vertices in which the subtrees 
at each vertex are linearly ordered and the components of the forest (or equivalently their roots) 
are also linearly ordered. The reasoning in the preceding footnote explains why (p{gY gives rise to 
forests in which the components are linearly ordered. 
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efficient. The constraints tell us that kn = m and Xl^o'^^" = m — £, so that 

E^=o(l -N)kN = i. This is precisely f lfl) . 

If work over the rationals rather than the integers, things become slightly easier: 
expanding out the summand in (13.61) . we obtain 

^ir-'i(i:.„c»)"'^^r-'] E G,,I....) (n«^")c^*. 

\n=0 / fco,fci,fc2,.->0 ^ ^ \n=0 / 

J2 k„=m 

(3.8) 

which again matches (13. 4p . □ 

Remark 3.3. To see easily (i.e., without the full combinatorial interpretation) that 
the numbers (13.41) are indeed integers, it suffices to note that 

n ■ ■ ■ , ki-i, ki — 1, ki^i, . . . 

is a multinomial coefficient and hence an integer, and that i = Yl'iZoi^~'^)ki ^ly virtue 
of the constraint. I thank Richard Stanley and Ira Gessel (independently) for this 
observation. □ 

Remark 3.4. Let us stress that the proof of the counting result given in [42, Theo- 
rem 5.3.10] is purely combinatorial; it is based on a bijection between unlabeled plane 
forests and a certain class of words on a finite alphabet. We shall use it to deduce the 
implicit function formula (and in particular the Lagrange inversion formula) as a sim- 
ple corollary, following the second proof of [42, Theorem 5.4.2]. This approach to the 
Lagrange inversion formula goes back to Raney [37] and was later simplified by other 
authors [33,38]. On the other hand, a much easier (though perhaps less enlightening) 
way of obtaining this counting result is to first prove the Lagrange inversion formula 
(e.g. by the algebraic argument given in the ffist proof of [42, Theorem 5.4.2]) and 
then use it to obtain the enumeration of plane trees or forests as a straightforward 
application [45] [16, section 2.7.7]. □ 

We can now deduce the general implicit function formula as an easy corollary 
of Proposition 13.21 The key point is that (p{gY is a formal power series in go,gi 
whose coefficients are polynomials in g2,g3,...: therefore, in the identity (p{g) = 
Yl'^=o dnfig)"" we can make the substitutions gn ^ gniw), where the gniw) are formal 
power series in an arbitrary collection w = {wi)i^i of indeterminates, provided that 
(7o(w) and giiw) have zero constant term; the series gniw) for n > 2 are unrestricted. 
This yields a formal-power-series version of (II. 8p : 

Theorem 3.5 (Implicit function formula — formal-power-series version 7^1). Let R 
he a commutative ring with identity element. Let G{z, w) = Yl'^=o 9n{w) z"' be a formal 
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power series in indeterminates z and w = (wj)ig/, with coefficients in R, satisfying 
G{0,0) = and {dG/dz){0,0) = [i.e., go{0) = and ^i(O) = 0/. Then the unique 
formal power series (p{w) with zero constant term satisfying f{w) = G{ip{w),w) is 
given explicitly by 



(p{w) 



^ Ci{ko,ki,k2,...)Ylgr. 

fco,fci,fc2,...>0 



w 



(3.10) 



n=0 



for all integers i > 1, where Ci{ko, ki^ k2, ■ ■ ■) is given by (3^). Its coefficients are 
given by the finite sums 



2 ct 



m=l 



(3.11) 



More generally, if H{z, w) is any formal power series, we have 
[w'^]H{ip{w),w) 

dOiCw) 



\w 



]H{o,w) + Y.irw'WiCw) G(c, 



w) 



m=l 



dC 



G{C,w 



.m— 1 



(3.12) 



Proof. As just observed, substituting gn gniw) in (13. 4p proves (I3.10p . 

To prove (I3.12p . suppose first that H{z, w) = z^. If £ = 0, then obviously ip{w) 
1. If £ > 1, we can substitute gn ^ gn{w) in (13. 5p to obtain 



ip[w) 



m=l 



wY' - C^^^^G'(C,«;)'"-' 



(3.13) 



The general case of H{z,w) = Yl'iLohe{w)z^ is obtained from these special cases by 
linearity: we have 



H{^{w),w) = H{0,w) + J2inH{C,w)\G{C, 



771=1 



(3.14) 

Now take the coefficient of ly" on both sides, and observe that contributions come 
only from m < 2\ck.\ (see Remark 12.41) . □ 

The implicit function formulae given in Theorem 13.51 — in particular, the variant 
Lagrange formula (13. lip / (I3.12p — are valid in an arbitrary commutative ring-with- 
identity-element R, even if R is not of characteristic 0, because the numerical coef- 
ficients arising in (l3.1UI) - (l3.12p are all integers. On the other hand, if we are willing 
to assume that the ring R contains the rationals as a subring, then we can deduce 
the slightly more convenient explicit formulae (11.61) / (I1.7P for f{w). An argument 
completely analogous to that leading to (13.120 . but based on (13.61) instead of (13. Sp . 
proves: 
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Theorem 3.6 (Implicit function formula — formal-power-series version #2). Let R 
be a commutative ring containing the rationals as a suhring. Let G{z,w) be a formal 
power series in indeterminates z and w = (wj)ig/, with coefficients in R, satisfying 
(7(0,0) = and {dG/dz){0,0) = 0. Then there exists a unique formal power series 
(p{w) with zero constant term satisfying ip{w) = G{(p{w),w) , and its coefficients are 
given by the finite sums 



[w'^Mw) = -ir-'^'^Mcwr. (3.15) 

^ — ^ m 

m=l 

More generally, if H{z, w) is any formal power series, we have 

[w'^]H{^{w),w) = [w'^]H{0,w)+ ^ 1 ^^^G(C,^)™. (3.16) 

m Cc, 



m=l 



It is instructive to give two alternate proofs of Theorem 13.61 one deducing the 
result as a corollary of the usual Lagrange inversion formula for formal power se- 
ries, and one deducing it from our analytic version of the implicit function formula 
(Theorem 12.31) . 

Second Proof of Theorem 13.61 Proposition [31T b) gives the existence and 
uniqueness of (p{w). [The proof of fl3.15p / fl3.16p to be given next will provide an 
alternate proof of uniqueness.] Now we deduce (13.161) by using the formal-power-series 
version of Gessel's argument [42, p. 148] mentioned in Remark 12.61 Introduce a new 
indeterminate t, and study the equation z = tG{z, w) with solution z = $(w, t). Using 
the Lagrange inversion formula (11.41) for formal power series in the indeterminate t 
and with coefficients in the ring i?[[w]], we obtaiiij 

Hmw,t),w) = HiO,w) + J2^-l[r-']^E!^GiCwr (3.17) 
and hence 

[w'^]Hmw,t),w) = [w'^]H{o,w) + f2-[r-'w'^]^^^^i^G{c,wr (s.is) 



m=l 



as equalities between formal power series in t. But by the hypothesis on G, the 
only nonzero contributions to the sum on the right-hand side of fl3.18p come from 
m < 2\a.\ — 1 (see Remark 12.40 . so each side of (13.180 is in fact a polynomial in t (of 
degree at most 2|q;| — 1). So we can evaluate it at any chosen t E R, in particular at 
t = 1. This proves (I3.16p . □ 



* The Lagrange inversion formula for formal power series is most commonly stated for series 
with coefficients in a field of characteristic [42, Theorem 5.4.2], but in the form (|1.3p / (|1.4|) it 
also holds for series with coefficients in an arbitrary commutative ring containing the rationals. See 
e.g. [16, Theorem 1.2.4]. 
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Third Proof of Theorem 13.61 We see from the proof of Proposition [O that 
(p{wY will be given by a universal formula of the form fl3.10p with nonnegative integer 
coefficients q(/co, ^i, ^2, • • •)) remains only to find these coefficients. To do this, it 
suffices to consider the case R = C with a single indeterminate It furthermore 
suffices to consider the cases in which G{z, w) is a polynomial in z and w (of arbi- 
trarily high degree), since [w^]ip{w) depends only on the [z^w^]G{z,w) with j, k <n. 
But in this case we can apply the analytic version of the implicit function formula 
(Theorem [23!) • ^ 



Question 3.7. Can Theorems 13.51 and 13.61 be generalized to allow H{z,w) to be a 
Laurent series in z, at least when w is a single indeterminate and 601 = {dG/dw){0, 0) 
is invertible in the ring i?? (Or slightly more restrictively, when i? is a field of 
characteristic zero and 601 7^ 0?) Please note that by linearity it suffices to consider 
H{z,w) = h{w)z~^ {k > 1); and since h{w) just acts as an overall prefactor, it 
suffices to consider simply H{z,w) = z~^. One might try imitating [42, first proof of 
Theorem 5.4.2], possibly combined with the Gessel idea z = tG{z,w). □ 



In Theorems l3.5l and l3.6t we have for simplicity assumed that 610 = {dG/dz){0, 0) = 
gi{0) is zero. This seems more restrictive than Proposition 13. where it was assumed 
only that 1 — 610 is invertible in the ring R, but there is in fact no real loss of gener- 
ality here. For if 1 — 610 is invertible, then the equation z = G{z,w) is equivalent to 
z = G{z, w), where 

Giz,w) = il-b,o)-'[G{z,w)-b,oz] (3.19) 
satisfies {dG/dz){0,0) = 0. We can therefore apply Theorems 13.51 and 13.61 with 

5„W = |l-9.(0r x|*f) (3.20) 

I 9i{w) -9i{0) if n = 1 J 

On the other hand, if i? = M or C and |6io| < 1, we can avoid this preliminary 
transformation if we prefer: the power-series coefficients will then be given by abso- 
lutely convergent infinite sums, which are nothing other than the finite sums based 

^ CO / -\- k l\ 

on G in which each factor (1 — 610) ^"^ has been expanded out as ^ I 16' 



fc=0 

We therefore have: 



k 



ho- 



Theorem 3.8 (Implicit function formula — formal-power-series version #3). Let R 

he either the field M of real numbers or the field C of complex numbers. Let G{z,w) 
be a formal power series in indeterminates z and w = {wi)i^i, with coefficients in R, 
satisfying (7(0,0) = and \{dG / dz){Q,Q)\ < 1. Then there exists a unique formal 



^ Here it is important that C is of characteristic 0, in order to avoid losing information about the 
integers Q(fco, ki,k2, ■ ■ ■)■ 
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power series <f{w) with zero constant term satisfying ip{w) = G{(f{w),w), and its 
coefficients are given by the absolutely convergent sums 

oo 

[w^Mw) = Y,- ir-'w^m, wr . (3.21) 

^ — ^ m 

m=l 

More generally, if H{z,w) is any formal power series, we have 

K]i/(^H,^) = [w"']H{0,w) + f^l[C~i^"]^^^G(C,«;r. (3.22) 



m=l 



4 Some examples 

Example 4.1. Let G{z,w) = a{w)z + f3{w) with ^(0) = 0. Then ([L6]) tells us that 

oo ^ oo 

^{w) = J2-[r~']HwK + f3{w)r = 5^«(«;)'"-i/3(^), (4.1) 



m 

m=l m=l 



which sums to the correct answer P{w)/[1 — a{w)] provided that |a(i(;)| < 1. So some 
condition like \{dG/dz){0,0)\ < 1 is needed in order to ensure convergence of the 
series (11.61) . □ 

Example 4.2. The inverse-function problem f{z) = w with f{z) = Yl'^=i (^nZ^ (^i 7^ 
0) can be written in the form z = G{z, w) in a variety of different ways. The most 
obvious choice is 

Giz,w) = J^^^ (4-2) 
which leads to the usual form of the Lagrange inversion formula: 

°° in / 00 \ —m 

r\w) = Y.—<'' [C"-'] 1 + E -c""' • (4-3) 



m \ — ' ai 

m=l ^ n=2 



An alternative choice, proposed by Yuzhakov [49], is 



G(z,u;) = ^-M^, (4.4) 
ai ai 



which leads to 

00 _rr, m 



"o —m / \ / 00 V i 

m=l £=0 ^ ^ ^n=2 ^ 



After some straightforward algebra involving binomial coefficients, both forms can be 
shown to yield the same result: 

m=l fc2,fc3,...>0 n=2 

'^{n-l)kn=m-l 
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(see also [11]). However, (14. 3p / (14.61) expands f~^{w) as a power series in w, while 
(14. 5 p expands f~^{w) as a series in a different set of polynomials in w. 

Consider, for instance, f{z) = ze~^. Then the usual Lagrange inversion formula 
(II. 3p gives 

r\w) = E"^""'^' (4.7) 

m=l 

which is well known [42, Propositions 5.3.1 and 5.3.2] to be the exponential generating 
function for rooted trees (i.e., there are m^~^ distinct rooted trees on m labeled 
vertices). On the other hand, Yuzhakov's version (14. 4p / (14. 5p gives, after a short 
calculation, the alternate representation 

oo 

r\w) = J^Pmiw) (4.8) 

where 



m=l 



puw) = i-ir-' E 7T^^ I ' " U -^ (4-9) 

^ ^ ^ ^ ^ k\(k-l)\ \m- k\ ' ^ ^ 

k=\{m+l)/2] V y I. J 

here {^} denotes the Stirling subset numbers (also known as Stirling numbers of 
the second kind), i.e. the number of partitions of an n-element set into k nonempty 
blocks [17] I wonder whether the coefficients in Pm{w) have any combinatorial 
meaning. □ 

Example 4.3. Here is an application from my own current research [41]. The function 

oo „ 

F{x,w) = (4.10) 



n=0 



arises in enumerative combinatorics in the generating function for the Tutte polyno- 
mials of the complete graphs Kn [40, 46] and in statistical mechanics as the grand 
partition function of a single-site lattice gas with fugacity x and two-particle Boltz- 
mann weight w [39]. Let us consider F as a function of complex variables x and w 
satisfying \w\ < 1: it is jointly analytic on C x D and jointly continuous on C x D 
(here D and D denote the open and closed unit discs in C, respectively), and it is an 
entire function of x for every w G ©. 

When w = 0, we have F{x, 0) = 1 + x, which has a simple zero at a; = —1. One 
therefore expects — and can easily prove using Rouche's theorem — that for small \ w\ 



^° The key step in the derivation of (|4.8p / (|4.9p is the weU-known identity [17, eq. (7.49)] 
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there is a unique root of F{x, w) near x = —1, which can be expanded in a convergent 
power series 



oo 



Xo{w) = -1 - 5^a„i/;V (4.11) 

n=l 

The coefficients {a„} can of course be computed by substituting the series (14.111) into 
(I4.10p and equating term-by-term to zero; but a more efficient method is to use the 
imphcit function formula (11.61) . It suffices to set x = — 1 — z and define 



oo 



G{z,w) = (4.12) 



n=2 



We then obtain 



+ fi-^ + ii-" + + + • • • • (4-13) 

I conjecture — but have thus far been unable to prove — that all the coefficients in 
this power series are nonnegative. Since it is known [31,32,35] that Xo{w) is analytic 
in a complex neighborhood of the real interval < w < 1, this conjecture would 
imply, by the Vivanti-Pringsheim theorem [20, Theorem 5.7.1], that xo{w) is in fact 
analytic in the whole disc \w\ < 1, i.e. that the series (I4.13P has radius of convergence 
exactly 1. (It is not hard to show that Xq{w) — — oo as w | 1, so that the radius of 
convergence cannot be bigger than 1.) 

Using (II. 7p we can also compute power series for functions ofxo{w). For instance, 
we have 

\og[-xoiw)] = Iw + Iw^ + + + + §^w^ + ^w' + 

^ 20736 ^ 46080 ^ 84480 ^ 3317760 T ■ ■ ■ ■ V^--^^^ 

I conjecture that all the coefficients in (I4.14p are nonnegative. By exponentiation this 
implies the preceding conjecture, but is stronger. We also have 

-- 1 - Iw - - ^w"" - ^w^ - ^w'' - ^w"^ - ^w' - j^W^ 

_ _ ^y^lO _ ^11 _ _ .4 

6912 23040 4608 92160 K^'-^^J 



Xo{W) 



I conjecture that all the coefficients in (14.151) after the constant term are nonpositive. 
This implies the preceding two conjectures, but is even stronger. Using Mathemat- 
ICA I have verified all three conjectures through order w^^. Indeed, by exploiting 
the connection between F{x,w) and the generating polynomials C„(f) of connected 
graphs on n labeled vertices [40,41,46] — or equivalently the inversion enumerator 
for trees, /„(w) [42, Exercise 5.48, pp. 93-94 and 139-140] — I have computed the 
series Xo{w) and verified these conjectures through order w'^'^^. 

The relative simplicity of the coefficients in (I4.15P compared to (14.130 / (14.140 sug- 
gests that —1/xq{w) may have a simpler combinatorial interpretation than —xo{w) or 
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log[— a;o(w)]. Please note also that since xq[w) —oo as w ^ 1, the coefficients {6„} 
in —1/xq{w) = 1 — Yl'^=i ^nW"" — if indeed they are nonnegative — add up to 1, so 
they are the probabilities for a positive-integer- valued random variable. What might 
such a random variable be? Could this approach be used to prove the nonnegativity 
of {64? 

Please note also that the coefficients {a„} and satisfy the discrete-time re- 
newal equation 

n— 1 

On = ^hkan-k ■ (4.16) 

fc=l 

Therefore, if the {6„} are nonnegative, they can be interpreted [9, Chapter Xlll] as the 
probability distribution for ffist occurrences (or equivalently for waiting times between 
successive occurrences) of a recurrent event while the {a„} are the probabilities of 
occurrence tout court: 

ttn = P(i^ occurs at the nth. trial) (4.17a) 
hn = F(^ occurs for the ffist time at the nth trial) (4.17b) 

What might such a family of recurrent events be? 

For what it's worth, if we define ctv = 1 — ^^=i 6n, we find empirically (at least 
up to = 775) that that c'jy = 2 A*"! cat is an integer, with 

c'l, . . . , = 1, 1, 2, 5, 20, 85, 490, 3185, 23520, 199605, 1901130, 
19767825, 223783560, 2806408605, 37447860450, 540137222625, 
8284392916800, 135996789453525, 2363554355812650, 43437044503677825 

(4.18) 

Can anyone figure out a combinatorial interpretation of these numbers? 

It is also known [41] that —xq{w) diverges as w t 1 with leading term e~^(l — w)"^, 
which suggests that we have lim„^oo a„ = and Yl^=i '^^n — e- If we define 
djsf = X]n=i = X]rt=i[V("' ^ 1)' ^ nbn], wc find empirically (at least up 

to = 775) that = 2 (AT - 1)! div and e'^ = 2 (A^ - 1)! ejv are integers, with 

4,...,4o = 1> 2, 5, 18, 77, 420, 2625, 19110, 158025, 1457820, 14872725, 
166645710, 2032946685, 26754868140, 379216422585, 5747274883350, 
92854338001425, 1591646029073100, 28870013167120125, 552364292787857550 

(4.19) 

e[, . . . , e^o = 1> 2, 5, 14, 53, 232, 1289, 8290, 61177, 515000, 4855477, 
50364514, 571176005, 7098726832, 94733907025, 1361980060802, 
20893741105009, 342071315736280, 5936899039448717, 108967039136950450 

(4.20) 

Might these numbers have some combinatorial interpretation? □ 
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5 Possible multidimensional extensions 



In this paper I have restricted attention to the imphcit-function problem in one 
complex variable (i.e., 2; G C though w lies in an arbitrary space W). Yuzhakov 
[2,49,50] goes much farther: he gives a beautiful explicit formula for the solution 
of the multidimensional implicit-function problem F{z,w) = with z G C^, w G 
C"^ and F: x C^^ under the usual hypothesis that the linear operator 

{dF/dz){0,0) is nonsingular. I suspect that the approach of the present paper can 
likewise be extended to the corresponding multidimensional situation — that is, z = 
G{z,w) with z e C^, w e W and G: x W — under the hypothesis that 

the linear operator {dG/dz){0,wo) has spectral radius < 1. Indeed, such a result 
presumably holds when is replaced by a complex Banach space. As we have seen, 
the proof of Theorem 12.31 given here applies verbatim when w lies in an arbitrary 
space, since the variables w simply "go for the ride". But multidimensional z is a 
genuine generalization; and for lack of time and competence, I have not attempted 
to pursue it. See [1, 3, 6, 10, 14, 15, 19, 28, 29] for information on multidimensional 
Lagrange inversion formulae, and [27] for a survey of implicit function theorems. 
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